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Abstract 

We present an adaptive approach for valuing the European call option on assets with stochas- 
tic volatility. The essential feature of the method is a reduction of uncertainty in latent 
volatility due to a Bayesian learning procedure. Starting from a discrete-time stochastic 
volatility model, we derive a recurrence equation for the variance of the innovation term in 
latent volatility equation. This equation describes a reduction of uncertainty in volatility 
which is crucial for option pricing. To implement the idea of adaptive control, we use the 
risk-minimization procedure involving random volatility with uncertainty. By using stochas- 
tic dynamic programming and a Bayesian approach, we derive a recurrence equation for the 
risk inherent in writing the option. This equation allows us to find the fair price of the Euro- 
pean call option. We illustrate numerically that the adaptive procedure leads to a decrease 
in option price. 

Keywords: Stochastic Volatility, Adaptive Decision Process, Bellman's equation. 



1 Corresponding author. 

E-mail address: sergei.fedotov@manchester.ac.uk 



1 Introduction 



Empirical observations on derivative prices show that implied volatilities vary with strike 
price giving the well known volatility "smile" effect jTj. This suggests that the behavior of 
the asset price, on which the option is written, may be captured by models that recognize 
the stochastic nature of volatility (SV). One can describe the underlying stock by a random 
process that is driven by a random volatility (see, for example, El El and references 
therein). A common feature of these models is that the random volatility is described by 
a random process with known statistical characteristics. The problem with this approach 
is that the volatility is not a tradeable asset, and therefore the classical hedging can not 
be applied. As a result the equation for an option price involves an unknown parameter, 
the market price of volatility risk The difficultly is that this parameter is not directly 
observable and one has to make additional assumptions on the pricing of volatility risk. 
Moreover, in practice it is very difficult to suggest the exact statistics for the volatility in 
advance. A different approach to stochastic volatility has been suggested in [SHE]. Instead 
of finding the exact option price one can focus on the pricing bands for options. The other 
problem of stochastic volatility models is associated with the efficient estimation of the 
unobserved volatility process from financial data. These lead some researchers to accept the 
idea of uncertain volatility when all prices for the option are possible within some range 
[7j. The question arises whether this uncertainty can be reduced during decision making. 
One of the main purposes of this work is to answer this question by using the simplest SV- 
model, the idea of Bayesian learning procedure and adaptive decision process (see jElE])- We 
suppose that some of the statistical properties of volatility are not known initially. Instead, 
we assume that we have an a priori estimation for them. By using the Bayesian approach 
(see [9J), we revise these a priori characteristics of random volatility. To implement the 
idea of adaptive feedback control for option pricing, we use a risk-minimization procedure 
(see and references therein) and stochastic dynamic programming [HI El El- This work 
extends the idea of using adaptive processes in option pricing suggested in [T3J . Application 
of stochastic dynamic programming for pricing of derivatives can be found in [T^l ITS] . It 
should be noted that the Bayesian learning approach to option pricing was also used in 
IT7| ITS! ITU] in different contexts. Bayesian estimation of a stochastic volatility model by 
using option price was proposed in [20]. The outline of the paper is as follows. In Section 2, 
we introduce a discrete-time stochastic volatility model and describe the Bayesian learning 
procedure. We derive the recurrence equation for the variance of the innovation term in 
latent volatility equation. In Section 3, we describe the risk-minimization procedure and 
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derive the Bellman's equation for the risk inherent in writing the option. By using this 
equation we find the fair price of European call option. We illustrate numerically that the 
adaptation procedure leads to a decrease in the option price. 

2 Uncertain Stochastic Volatility and Adaptation 

In this paper we consider a simple market with two traded assets: a riskless bond, B n , and 

a risky asset (stock), S n , evolving at discrete times n = 0,1, N. The bond price B n is 

governed by the recurrence relation 

B n+1 = (l + r)B n , B >0, (1) 

with the constant interest rate r > 0. The stock price S n is governed by the stochastic 
difference equation 

In (^) = Cn, So > 0, (2) 
where the stochastic return £ n is modelled as follows 

£ n = li + (T6 n e h »' 2 . (3) 

Here \x is the mean return from holding a stock at time n, a is the instantaneous volatility, 
h n is the log- volatility (latent volatility) at time n that follows a stationary AR(l)-process: 

h n+ i = tph n + u n , (4) 

where is the "persistence" parameter of volatility: \<p\ < 1. This is the simplest version of a 
stochastic volatility model and gives a discrete-time approximation for standard continuous 
stochastic volatility models (see, for example, |HI2I1)- There are two sources of uncertainty 
in stochastic equations (J3J) and (jljl. namely the innovation terms S n and u n . We assume that 
5 n and u n are both Gaussian sequences of mutually independent random variables, and S n 
has the following probability density function: 

^) = -lp W ,<, } = -I=exp{4}. (5) 

Let us now discuss the statistical properties of u n . The empirical observations suggest that 
the log- volatility can be reasonably approximated by the Gaussian distribution [221 EH 121] • 
Suppose that the investor does not know the exact value of the variance of u n . With enough 
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information from the past history, the investor is assumed to have an a priori value for it, 
such that the probability density function for the first term, u , is 

d 1 f u 2 1 

Pow = ;r p <u \= /k — 2 exp i r • 6 

Our idea is to use an adaptive procedure by which the uncertainty regarding u n can be 
reduced by using the equations (J2J)-®- For this purpose one needs an equation involving 
both random sequences, u n and 5 n in a linear combination. From ©-(jH) one can find 

In S n+ i + h n+ i = In S n + \i + a5 n e hn/2 + ^h n + u n . (7) 

If we start with the given values of So and ho, it follows from (JSJ and (J7J) that the likelihood 
of In Si + hi conditional on u = u is 

rn QMh\\ r i (H%) + h - ^ - V h - u) 2 \ 

L(lnS 1 + = C L exp < >, (8) 

where Cl is independent from u. By using (JHJ) and Bayes' rule 

Pi (« ln Si + hi) = — —— (9) 
J L (In Si + hi\ u ) p {u)du 

(see [9j) one can find a posteriori pdf of u\ conditional on In Si + hi'. 



(hi(||) + hi- /I- (ph -u 



u 2 



p x (u\ \nSi + /ii) = Ciexp { ^ exp <j j , (10) 

where G\ is independent of u. Equation (fTU|) gives the learning procedure that can be used 
at each stage of the process to revise the probability density function for u n . By using (JTU1) 
we can find the recurrence relation for p n (u) : 

2 



(ln(%±i) + h n+ i -p- ifh n 



u 



p n+1 (u) = C n+ iexp{ }P n (u). (11) 



It is a well known property of Gaussian distribution (see jU]) that this learning procedure 
gives a revised probability density function that is also Gaussian: 

„.(„)= -P K <„ } = ^«p{-(!t^l}. ( 12 , 
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The standard deviation a n and the mean m n at successive stages are given by recurrence 
equations 



^2 



e - ' 1 "^ + a 2 

m ™+ 1 = _ h 2^ 2 m « + 2^ 2~h • ( 14 ) 

—inert. + o l at + cr^e"" 1 



At each discrete time n, the uncertainty about the value of u n is described by the probability 
density function p n {u) given by (j!2)) which is completely specified by the mean value m n 
and the standard deviation a n . These state variables are the sufficient statistics, and their 
transformation from on stage to the next is given by equations (fT5j) and ([14)1 . Equation (JT5j) 
shows that at every stage 

2 ^2 
°n+l < a m 

that is, the variance of u n+ \ is smaller than the variance of u n . In other words the uncertainty 
about the innovation u n is reduced at every stage n. This is crucial for option pricing. Let us 
note that although a\ — > as n — > oo, stochastic volatility does not disappear overall. Now 
we are in a position to apply the adaptive procedure ()13|) for the pricing of an European call 
option. 



3 Adaptive stochastic optimization 

Assume that an investor sells a European call option with strike price X for Co and invests 
the money in a portfolio containing A shares and 9$ bonds. The investor is concerned with 
hedging this position. It is well known that in incomplete markets a portfolio replicating the 
payoff of the option ceases to exist. Therefore the investor tries to find a trading strategy 
that reduces the risk of an option position to some intrinsic value. The value of the portfolio 
V n at time n is given by 

V n = A n S n + 9 n B n , Vo = C . (15) 
Using the self-financed trading strategy condition 

(A n+ i — A n )S n+ i + (0 n +i — 9 n )B n+ i = 0, 

one can obtain an equation for V n : 

h n-l 

V n = (1 + r)7 ft _! + A^e*^- 16 ^ - - 1 - r)S n ^. (16) 

Let us recall the theory of risk-minimization in option pricing that was developed in [23 
I27| (see also (21312111 EH!)- The investor's purpose is to choose a trading strategy {A , .., Aat_i} 
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such that the terminal value of the portfolio, Vn, should be as close as possible to the options 
payoff: (Sjv — X, 0) + . Thus, the expected value of their difference, under the "real-world" 
probability measure, must be equal to zero: E{(5jv — X, 0) + — Vn} = 0, while the variance 

R = E{((S N -X,0) + -V N ) 2 } (17) 

as a measure of the risk should be minimized. Let us consider the problem of minimizing 
the risk function R for an iV-stage process, starting from the initial states 

S = S, V = V, h = h (18) 

with a priori probability density Po(u) specified by the mean m = 0, and the standard 
deviation a = o u . Here we use a stochastic programming procedure proposed in jTSj. Let 
us introduce the minimal risk 

R N (S,V,h,a u )= min E{((S N - X, 0)+ - V N ) 2 } (19) 

Ao, An-i 

that can be achieved by starting from the initial state ()18j) with a priori pdf po(u). After 
the first decision A = A of the iV-stage process we have 

Sl = Se" +a5eh/2 , (20) 
V x = (l + r)V + A(e» +aSeh/2 - 1 - r)S, (21) 
h 1 = (ph + u 1 (22) 

a\ = ff u - . (23) 

The principal of optimality yields the general functional recurrence equation (Bellman's 
equation) 

R N {S,V,h,a u ) = mmE| J R 7V -i(^ +<T<5e ' l/2 , (24) 



1 + r)V + A (e^ h/2 - 1 - r)S, V h + u, ° °» - )}. 



(see [HI HH H21)- By using the explicit expressions for cp(5) and po(u), the equation can 
be rewritten as follows 



r -i roc poo 

R N (S, V, h, a u ) = mini / / R^Se^ 6 ^ 2 

A I 2tto~ u J_ 00 J_ 00 



2 2 

'l + r)V + A(e> 1+aSe 1 -l-r)S,tph + u, ho u - )e 2 ^dSdu\. (25) 

e~ h af, + a 2 ) 
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To solve ()25|). we need to know the value of the risk function Rn(S, V, h, a u ) for iV = 1. It 
follows from (|17|) that 

Rx(S, V, h) = minE {((Se^ 6 ^ 2 - X, 0)+ - (1 + r)V - A(e» +aSeh/2 - 1 - r)S) 2 } . (26) 
Using ((SJ) we have 



R 1 (S,V,h)=wm\-^= f 

A ^ V 27T J_ ( 



((Se!** 6 *' 2 - X ,0)+ 

3 

- (1 + r)F - A(e^ +,j5e ' l/2 - 1 - r^V^cW}, (27) 



where 

(Se^ Se f -X,0) + = \ 1 ^ . (28) 

L otherwise 

The integral in ([2*7]) can be evaluated exactly (see Appendix A). This allows us to find the 
explicit expressions for the optimal policy, Ai(S,V,h) and the risk R\(S,V,h) when there 
is one stage-to-go (see Appendix A). Now putting iV = 2 in equation (J25J) and using the 
expression for Ri, one can find the risk function R 2 : 



- i /*oo poo 

R 2 (S,V,h) = mini / / RASe^ 5 ^ 2 

(l + r)F + A{e» +a5e ' -l-r)S,<ph + u)e 2 ^dtfduj 

Note that at each stage, the risk function R n does not depend on the mean m n . What 
is more, adaptation procedure starts only at the third step, when the risk f? 3 becomes a 
function of a u that is R 3 = R 3 (S,V,h,a u ). The procedure can be repeated any number of 
times to give the solution of the problem for any value of N. The attractive feature of this 
algorithm is the simplicity with which the adaptation procedure can be applied. The initial 
investment V determining a fair option price C — V can be obtained from the equation 

dR N (S,V,h,a u ) _ 

dV " °" (29) 

In particular, for a one stage process (N = 1), after minimization, we obtain 

S(l -2r + 2fi) + + eV - \n(j))(2Af(d) - 1)) , (30) 
where M{d) is the cumulative distribution function for a Gaussian variable: 



1 f d _s 2 

N(d) = —= \ e ~ds, d 

V27T J-oo 



(// + eV - ln(f ))e~ h / 2 
a 



The above results can be compared to those corresponding to the standard model without 
an adaptive procedure. In the later case, an a priori density function for u © is kept at 
each stage, and the risk function becomes the function of S, V, and h only. The Bellman 
recurrence equation for the risk minimization problem is then given by 

, i poc roc 

R N (S, V, h) = mini / / R N ^{Se^ a5eh/ \ 

2 2 

(1 + r)V + A{e^ +aSeh/2 - 1 - r)3, <ph + u)e~^~^d8du}, (31) 

where R\ is the same as (|27jl. To illustrate our adaptive control method we value the 
European call option with the strike price X = 50, the initial log- volatility value h = 0.1, 
the interest rate r = 0.05, the expected return \i = 0.1, the volatility parameter a = 0.2, 
the maturity of the option T — 1, and (p = 0.1. We also calculate the option price for the 
constant volatility case {h — 0). Figure 1 shows the results for the option price as a function 
of S for different number of steps of the adaptive (learning) procedure. To illustrate the 
usefulness the adaptive approach, we computed the value of a European call option for the 
standard (no-learning) procedure using equation (|3*T|) . In Figure 2 we show the difference 
between the option prices with and without adaptation. The number of steps iV = 12. It 
is clear that the adaptive procedure leads to a decrease in option price. Let us now discuss 
the "smile" effect arising from the above methodology, that is, implied volatility varies with 
strike price. Using equations (24) and (29) for iV = 12, we can retrieve various call option 
prices, C a d ap {X), for different strike prices X. Implied volatility I = o~(X), can then be 
computed by inverting the formula 

Cbs{X,I) = C a dap(X), (32) 

where Cbs is the usual Black-Scholes price. We plot our results in Figure 3, where we 
get a "smile" curve, with a minimum near at-the-money. Thus, the adaptive methodology 
presented here is shown to reproduce the "smile" effect observed in the market. There is 
a one-to-one relationship between the volatility "smile" and the implied distribution. The 
volatility "smile" in Figure 3 implies some kurtosis and skewness in the implied distribution. 

4 Conclusions 

In contrast to most stochastic volatility models we applied an adaptive control procedure 
which allows us to revise the stochastic characteristics of latent volatility during decision 
making. We assumed that the statistical properties of an innovation term in a log-volatility 
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equation are not known initially, but instead we have an a priori estimation for them. By us- 
ing Bayesian analysis, we derived the recurrence equation for the variance of innovation term. 
This equation describes a reduction of uncertainty about volatility which is crucial for op- 
tion pricing. We implemented the idea of adaptive procedure by using the risk-minimization 
analysis and stochastic dynamic programming. We showed that the adaptation leads to a 
decrease in the option price compared to the standard models without learning. The adap- 
tive algorithm allows the investor to hedge his position in a consistent way between two 
extremes: a completely uncertain volatility and an ideal situation of known volatility. Of 
course this paper leaves many open questions how to model the uncertainties in a stochastic 
volatility setting. For example, one can introduce the uncertainty in the stochastic return 
(J3j) rather than in the latent volatility (jlj. We hope to address these questions in the future 
works. 



9 



Appendix A 

To evaluate the integral in (J2*7|) . we need to split it into two integrals. That is, 

1 /.<T- 1 e- h / 2 (ln(XS'- 1 )-/t) 2 

R 1 (S,V,h)=wm{-= / {-(l + r)V-A{e^ +a8eh/2 -l-r)S) 2 e-^d5+ 

^ ^ V 2% J —oo 

1 /"°° / \ 2 

-= / (Se^ +t7Seh/2 -X-(l + r)V- A(e^ 5eh/2 - 1 - r)S) 

e-^dSy (A-l) 

By using Mathematica one can get the following expression for Ri : 

R 1 (S, V, h) = minl^L (2e» + ^S{l + A)(-2(l + r)V + hi Aa+ 
5(l+/i+A(eV-2r+ / i)))a+((K+ln(^)+5(-l+rA-(l+A) / x)) 2 +e^(l+A)V)Ar( C /)) + 
^= (( (-2e^^SA(-2(l + r)V + A(ln(|) + S(-l + 

v^tt(((1 + r)V + 5A(/x + eV 2 )) 2 + e fe S 2 AV)(2 - 2jV(d))) (A-2) 

where 

, * / 1 J (^ + eV 2 -ln(f))e~ ft /2 eV 
Af(d) = -7= / e ^ds, d = — , I = ll + 



(A/"(ef) is the cumulative distribution function for the normal distribution). Differentiation 
with respect to A leads to the optimal first decision when there is one stage-to-go, Ai(S, V, h), 
starting from the initial state S and V: 

Ai(S, V) = (e- i (25(e^ + ^5(r-^)ae h / 2 -e i v / 27((r- /U )((l + /U )V r + ln(^)- / u + eV 2 ) 

e h Sa 2 )^ + 2e l y^S((r - /i)(S - ln(^) + Sli) - e h Sa 2 ){2Af{d) - 1))) 

(V2^S 2 ((r 2 + e h a 2 ))- 1 1 

and substituting this in the expression for R\(S, V, h) gives 
Ri(S,V,h) = -^-=(2e tl+e± ^S{{l + r)V - ln(^) + 5(1 + 2^)a+ 



e l V2^)(V + rV + In A + + r + Li)) 2 + e h S 2 {l + r) V+ 

o 

e^(((l + r) V + S(r - /i)) 2 + e h S 2 a 2 + ((S - ln(^)) 

(-2(1 + r)V - ln(^) + e h/2 S(l - 2r + 2/i)) + e^ 2 (l + a 2 )2N(d) 
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Figure 1: Option price as a function of asset price for different number of stages of the adaptive 
process. 




Figure 2: Comparison of option price with and without adaptation for N = 12. 
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Volatility 'smile' for the adaptive procedure, N=12 
0.5 n 



0.4 




~ 0.2 



0.1 



20 40 60 80 

K 

Figure 3: Implied volatility I = cr(X) as a function of strike price X. The constants are taken as 
in Figures 1 and 2. 
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